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Abstract 

A simultaneous measurement of the top-quark, W-boson, and neutrino masses is re- 
ported for tt events selected in the dilepton final state from a data sample correspond- 
ing to an integrated luminosity of 5.0 fb~ collected by the CMS experiment in pp col- 
lisions at y/s = 7 TeV. The analysis is based on endpoint determinations in kinematic 
distributions. When the neutrino and W-boson masses are constrained to their world- 
average values, a top-quark mass value of Mt = 173.9 ± 0.9 (stat.)!^ o ( s Y s t-) GeV is 
obtained. When such constraints are not used, the three particle masses are obtained 
in a simultaneous fit. In this unconstrained mode the study serves as a test of mass 
determination methods that may be used in beyond standard model physics scenar- 
ios where several masses in a decay chain may be unknown and undetected particles 
lead to underconstrained kinematics. 
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1 Introduction 

The determination of the top-quark mass sets a fundamental benchmark for the standard model 
(SM), and is one of the precision measurements that defines electroweak constraints on pos- 
sible new physics beyond the SM [1J. With the recent observations [|2l [3) of a Higgs boson 
candidate at a mass of approximately 125 GeV, existing data can now overconstrain the SM. 
The top quark plays an important role in such constraints because its large mass, appearing 
quadratically in loop corrections to many SM observables, dominates other contributions. It 
is also key to the quartic term in the Higgs potential at high energy and therefore to the 
question of stability of the electroweak vacuum flU [5]. For these reasons, precise top-quark 
mass determinations are essential to characterize and probe the SM. Recent results obtained at 
the Large Hadron Collider (LHC) for the top-quark mass in tt events include those reported 
by ATLAS 0, M t = 174.5 ± 0.6 (stat.) ± 2.3 (syst.) GeV, and by the Compact Muon Solenoid 
(CMS) 0, M t = 173.49 ± 0.43 (stat.) ± 0.98 (syst.) GeV, using the semileptonic decay chan- 
nel of the tt pair. The CMS Collaboration has also reported a measurement [8J in the dilep- 
ton channel, Mt = 172.5 ± 0.4 (stat.) ± 1.5 (syst.) GeV. A recent summary of top-quark mass 
measurements conducted by the CDF and DO Collaborations Q reports a combined result 
M t = 173.18 ± 0.56 (stat.) ± 0.75 (syst.) GeV. 

In parallel with recent measurements of the properties of the top quark at the LHC, there has 
been a great deal of theoretical progress on methods using endpoints of kinematic variables 
to measure particle masses with minimal input from simulation. These methods are generally 
aimed at measuring the masses of new particles, should they be discovered, but can also be 
applied to measure the masses of standard model particles such as the top quark. Such an ap- 
plication acts as both a test of the methods and a measurement of the top-quark mass utilizing 
technique very different from those used in previous studies. 

Indeed, top-quark pair production provides a good match to these new methods, as dilepton 
decays of top-quark pairs (tt — > (b£ + v) (b£~v)) provide challenges in mass measurement very 
similar to the ones that these methods were designed to solve. A key feature of many current 
theories of physics beyond the standard model is the existence of a candidate for dark matter, 
such as a weakly interacting massive particle (WIMP). These particles are usually stabilized 
in a theory by a conserved parity, often introduced ad hoc, under which SM particles are even 
and new-physics particles are odd. Examples include R-parity in supersymmetry (SUSY) and 
T-parity in little-Higgs models. One consequence of this parity is that new physics particles 
must be produced in pairs. Each of the pair-produced particles will then decay to a cascade of 
SM particles, terminating with the lightest odd-parity particle of the new theory. In such cases, 
there will be two particles which do not interact with the detector, yielding events where the 
observable kinematics are underconstrained. Mass measurements in these events are further 
complicated by the presence of multiple new particles with unknown masses. 

The dilepton decays of tt events at the LHC offer a rich source of symmetric decay chains termi- 
nating in two neutrinos. With their combination of jets, leptons, and undetected particles, these 
tt events bear close kinematic and topological resemblance to new-physics scenarios such as the 
supersymmetric decay chain illustrated in Fig. |l.l| This correspondence has motivated [10J the 
idea to use the abundant tt samples of the LHC as a testbed for the new methods and novel 
observables that have been proposed to handle mass measurement in new-physics events |[TT|. 
A simultaneous measurement of the top-quark, W-boson, and neutrino masses in dilepton tt 
decays closely mimics the strategies needed for studies of new physics. 

The analysis presented here focuses on the Mt2 variable and its variants |TT1IT2"I . These kine- 
matic observables are mass estimators that will be defined in Sec. |4J The goals of this analysis 
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Figure 1.1: Top-quark pair dilepton decays, with two jets, two leptons, and two unobserved 
particles (left) exhibit a signature similar to some SUSY modes (right). In the figure, u, % , 
v, and x° denote the u-squark, chargino, sneutrino, and neutralino respectively; an asterisk 
indicates the antiparticle of the corresponding SUSY particle. 

are two-fold: to demonstrate the performance of a new mass measurement technique, and 
to make a precise measurement of the top-quark mass. To demonstrate the performance of 
the method, we apply it to the tt system assuming no knowledge of the W-boson or neutrino 
masses. This allows us to measure the masses of all three undetected particles involved in the 
dilepton decay: the top quark, W boson, and neutrino. This "unconstrained" fit provides a test 
of the method under conditions similar to what one might expect to find when attempting to 
measure the masses of new particles. In order to make a precise measurement of the top-quark 
mass, on the other hand, we assume the world-average values for the W boson and neutrino 
masses. This "doubly-constrained" fit achieves a precision in the top-quark mass determina- 
tion similar to that obtained by traditional methods. 

In considering any top-quark mass measurement, however, it is critical to confront the fact that 
deep theoretical problems complicate the interpretation of the measurement. The issues arise 
because a top quark is a colored object while the W boson and hadronic jet observed in the 
final state are not. In the transition t — > Wb, a single color charge must come from elsewhere 
to neutralize the final-state b jet, with the inevitable consequence that the observed energy and 
momentum of the final state differ from that of the original top quark. The resulting difference 
between measured mass and top-quark mass is therefore at least at the level at which soft 
color exchanges occur, i.e. ~Aqcd fl3lfl4l - In the current state of the art, a Monte Carlo (MC) 
generator is normally used to fix a relationship between the experimentally measured mass of 
the final state and a top-quark mass parameter of the simulation; but model assumptions upon 
which the simulation of nonperturbative physics depend further limit the precision of such 
interpretative statements to about 1 GeV ||T5l . 

We therefore take care in this measurement to distinguish between the interpretive use of MC 
simulation described above, which is inherently model dependent, and experimental proce- 
dures, which can be made clear and model independent. A distinctive feature of the top-quark 
mass measurement reported here is its limited dependence on MC simulation. There is no 
reliance on MC templates, and the endpoint method gives a result which is consistent with 
the kinematic mass in MC without further tuning or correction. For this reason, the measure- 
ment outlined here complements the set of conventional top-quark mass measurements, and is 
applicable to new-physics scenarios where MC simulation is used sparingly. 



2 The CMS Detector and Event Reconstruction 

The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal di- 
ameter, providing a magnetic field of 3.8 T. Inside the superconducting solenoid volume are 
silicon pixel and strip trackers, a lead tungstate crystal electromagnetic calorimeter, and a 



brass /scintillator hadron calorimeter. Muons are measured in gas-ionization detectors em- 
bedded in the steel flux return yoke. Extensive forward calorimetry complements the coverage 
provided by the barrel and endcap detectors. A more detailed description of the CMS detector 
can be found in Ref. 1 16 1. 

Jets, electrons, muons, and missing transverse momentum are reconstructed using a global 
event reconstruction technique, also called particle-flow event reconstruction [17U18| . Hadronic 
jets are clustered from the reconstructed particles with the infrared and collinear-safe anti-fcx 
algorithm [19|, using a size parameter 0.5. The jet momentum is determined as the vecto- 
rial sum of all particle momenta in this jet, and is found in the simulation to be within 5% to 
10% of the true momentum over the whole transverse momentum (px) spectrum and detector 
acceptance. Jet energy corrections are derived from the simulation, and are confirmed in mea- 
surements on data with the energy balance of dijet and photon+jet events [20J. The jet energy 
resolution amounts typically to 15% at jet p T of 10 GeV, 8% at 100 GeV, and 4% at 1 TeV. The 
missing transverse momentum vector is defined by px = — Ei Pt where the sum is taken over 
all particle-flow objects in the event; and missing transverse "energy" is given by E™ ss = |px |. 

3 Event Selection 

The data set used for this analysis corresponds to an integrated luminosity of 5.0 fb~ of proton- 
proton collisions at a/s = 7 TeV recorded by the CMS detector in 2011. We apply an event 
selection to isolate a dilepton sample that is largely free of backgrounds. We require two well- 
identified and isolated opposite-sign leptons (electrons or muons) passing dilepton trigger re- 
quirements; the minimum px requirements for the triggers are 17 GeV and 8 GeV for the leading 
and sub-leading leptons. In addition we require at least two b-tagged jets, subsequently used in 
the top-quark reconstruction, and missing transverse energy. Here and throughout this paper, 
we use £ (and "lepton") to denote an electron or muon; the signal decays of interest are t — > b£v. 
Leptons must satisfy px > 20 GeV and the event is vetoed if the leptons have the same flavor 
and their dilepton invariant mass is within 15 GeV of the Z boson mass. If three leptons are 
found, the two highest-px leptons forming an opposite-sign pair are selected. Jets must satisfy 
px > 30 GeV after correcting for additive effects of pileup (multiple proton collisions in a single 
crossing) and multiplicative effects of jet energy scale calibration. Jets are further required to 
lie within \tj\ < 2.5, where rj is the pseudorapidity variable, rj = — ln[tan(0/2)]. The b-tagging 
algorithm is the Combined Secondary Vertex (CSV) tagger of Ref. [21], deployed here with an 
operating point that yields a tagging efficiency of 62% on a b-enriched multijet sample, and 
b-jet purity of 91% in the subsample passing all selection requirements of this analysis. Jet 
masses are required to satisfy a very loose requirement m- ]ei < 40 GeV to assure the existence of 
kinematic solutions and reject poorly reconstructed jets. The missing transverse energy must 
satisfy E^ 1SS > 30 GeV for e + e~ and p + p~ events and E™ ss > 20 GeV for e ± p. T events, where 
Drell-Yan backgrounds are smaller. With the exception of the b-tagging criteria and the b-jet 
mass requirement, all selection requirements summarized here are discussed in more detail in 
p2ll23l . The sample of events in data meeting all of the signal selection criteria contains 8700 
events. 



4 Kinematic Variables 

The endpoint method of mass extraction is based on several variables that are designed for 
use in the kinematically complex environment of events with two cascade decays, each end- 
ing in an invisible particle. The challenge here is two-fold, combining the complications of a 



4 Kinematic Variables 



many-body decay with the limitations of an underconstrained system. In a two-body decay 
A — > B C, the momentum of either daughter in the parent rest frame exhibits a simple and 
direct relationship to the parent mass. In a three-body decay A — > BCD, the relationship is 
less direct, encoded not in a delta function of momentum but in the kinematic boundary of the 
daughters' phase space. In general, the parent mass may be determined from the endpoints of 
the observable daughter momenta in the parent rest frame. To carry out this program, however, 
the daughter masses must be known and enough of the momenta be measurable or constrained 
by conservation laws to solve the kinematic equations. 

Applying this program to the measurement of the top-quark mass in the decay t — > b£v, one 
immediately encounters a number of obstacles. At a hadron collider, the tt system is produced 
with unknown center-of-mass energy and has an event-dependent pj-boost due to recoil from 
the initial-state radiation (ISR). Furthermore, in pp collisions we can apply constraints of mo- 
mentum conservation only in the two dimensions transverse to the beam direction. Since top 
quarks are normally produced in pairs, the individual neutrino momenta are indeterminate, 
adding further complication. These obstacles seem daunting but can be overcome by the use 
of "designer" kinematic variables Mj2 U2j and Mqt [24], which, by construction, address pre- 
cisely these issues. In this paper we use Mj2- Because the transverse momentum of the tt 
system varies from event to event, the py-insensitive version |25j[26l, Mj2±, is particularly use- 
ful. To measure the masses of the top-quark, W-boson, and neutrino, we measure the endpoints 
of three kinematic distributions, Uu, pt,b/ an d Mb£, as discussed in the following subsections. 

4.1 M T 2 and Subsystem Variables 

4.1 .1 The Mt2 Observable 

The variable Mj2 is based on the transverse mass, Mj, which was first introduced to measure 
the W-boson mass in the decay W — > i.v. In this case, Mj is defined by 

M^ = m 2 v + m 2 ( + 2(E v T E^-p v T -p f j). (4.1) 

The observable Mj represents the smallest mass the W boson could have and still give rise 
to the observed transverse momenta p T and p T = pi • The utility of Mj lies in the fact that 
Mj < Mw is guaranteed for W bosons with low transverse momentum. For a single W — > Iv 
decay such a lower limit is only marginally informative, but in an ensemble of events, the 
maximum value achieved, i.e. the endpoint of the Mj distribution, directly reveals the W boson 
mass. This observation suggests a "min-max" strategy which is generalized by the invention 

Of Mj2- 

The Mj2 observable is useful for finding the minimum parent mass that is consistent with ob- 
served kinematics when two identical decay chains a and b each terminate in a missing particle. 



Figure 1.1 shows both a SM and a new physics example. If one knew the two missing trans- 
verse momenta separately, a value of Mj could be calculated for either or both of the twin 
decay chains and the parent mass M would satisfy the relationship max(M|, Mj) < M. In 
practice the two missing momenta cannot be known separately, and are observable only in the 
combination p| + p j = ^t • This compels one to consider all possible partitions of pj into two 
hypothetical constituents p| and p T , evaluating within this ensemble of partitions the minimum 
parent mass M consistent with the observed event kinematics. With this extension of the Mj 
concept, the variable is now called Mj2- 

M T2 = min (max(Mf,M^)| . (4.2) 

p!+Pt=P't l j 



4.1 M T2 and Subsystem Variables 5 

As with Mt, the endpoint of the Mj2 distribution has a quantifiable relationship to the parent 
mass, and the endpoint of an Mx2 distribution is therefore a measure of the unseen parent mass 
in events with two identical decay chains. 



The observable Mj2 requires some care in its use. The presence of Ej = a /p T + m 2 in Eq. 4.1 
implies that one must either know (as in the case of W — > Iv) or assume (as in the case of 
unknown new physics) a value of the mass m of the undetected particle(s). In this paper we 
will refer to an assumed mass as the "test mass" and distinguish it with a tilde (i.e. in); the actual 
mass of the missing particle, whether known or not, will be referred to as the "true mass", and 
written without the tilde. Both the value of Mj2 in any event and the value of the endpoint of 
the Mj2 distribution in an ensemble of events are in the end functions of the test mass. 

Even when a test mass has been chosen, however, the endpoint of the Mj2 distribution may 
not be unique because it is in general sensitive to transverse momentum Pj = |Px| of the 
underlying two-parent system, which varies from event to event. The sensitivity vanishes if 
the test mass can be set equal to the true mass, but such an option will not be immediately 
available in a study of new physics where the true mass is not known. 

The Pj problem is instead addressed by introducing Mj2± |25ll26| , which uses only momentum 
components transverse to the Pj boost direction. In this way, Mj2± achieves invariance under 
Pj boosts of the underlying two-parent system. The construction of Mj2± is identical to that of 



Mj2 except that pj values that appear explicitly or implicitly in Eq. 4.1 are everywhere replaced 
by px^ values, where pxi_ is defined to be the component of px in the direction perpendicular 
to the Px of the two-parent system. Formally, 

p T ± = n T x (p T x n T ) , (4.3) 

where fix = Px/ |Pt| is the unit vector parallel to the transverse momentum of the two-parent 
system. 

4.1 .2 Subsystem Variables 

A further investigation of Mx2 and Mj2± reveals the full range of kinematic information con- 
tained in multistep decay chains by splitting and grouping the elements of the decay chain in 
independent ways. 

The Mx2 variable classifies the particles in an event into three categories: "upstream", "visible", 
and "child". The child particles are those at the end of the decay chain that are unobservable or 
simply treated as unobservable; the visible particles are those whose transverse momenta are 
measured and used in the calculations; and the upstream particles are those from further up 
the decay chain, including any ISR accompanying the hard collision. 

In general, the child, visible, and upstream objects may actually be collections of objects, and 
the subsystem observables introduced in Ref. [flOl parcel out the kinematic information in as 



many independent groupings as possible. Figure 4.1 shows two of the three possible ways 
of classifying the tt daughters for Mx2 calculations. The Uu variable, known as Mj 1 ®^ in Ref. 
ITUl uses the two leptons of the tt dilepton decays, treating the neutrinos as lost child particles 
(which they are), and combining the b jets with all other "upstream" momentum in the event. 
The f/bb variable, known as M T |\_ in Ref. [lOl uses the b jets, and treats the W bosons as lost 
child particles (ignoring the fact that their charged daughter leptons are in fact observable). It 
considers only ISR jets as generators of upstream momentum. 

For completeness, we note that a third Mj2± subsystem can be constructed by combining the b 
jet and the lepton as a single visible system. This variable, known as MSj_ in the nomenclature 
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Figure 4.1: A tt dilepton decay with the two subsystems for computing y.^ and ^b indicated. 
The "upstream" and "child" objects are enclosed in dashed rectangles, while the visible objects, 
which enter into the computation, are enclosed in solid rectangles. The jiu an d Hbb variables 



used here are identical to M|^j_ and Mj 2 }, of Ref. 



of Ref. llTOl . exhibits significant correlation with M\,£, the invariant mass of the b jet and lepton. 
A third observable is needed to solve the underlying system of equations, and for this we 
choose M^. 



4.2 Observables Used in this Analysis 

This analysis is based on two Mj2± variables, uu and ^,b as described above, and one invariant 
mass, M\,i, the invariant mass of a b jet and lepton from the same top-quark decay. These three 
quantities have been selected from a larger set of possibilities based on the low correlation we 
observe among them and the generally favorable shapes of the distributions in their endpoint 
regions. The observables can be summarized by the underlying kinematics from which they 
are derived, and the endpoint relations which include the top-quark, W-boson, and neutrino 
masses. 

For the f% variable, the shape of the distribution is known analytically [25|. In terms of the 
value x = ]iu and its kinematic endpoint x max , the normalized distribution can be written: 



f = «*<*> + <*-«> a 



4^ ^max 

m , 



(4.4) 



where the parameter a is treated as an empirical quantity to be measured. In practice, oc ~ 0.6, 
and the zero bin of y.n histograms will be suppressed to better show the features of the endpoint 
region. The origin of the delta function is geometric: for massless leptons, fi££ vanishes when 
the two lepton p T ^ vectors lie on opposite sides of the axis defined by the upstream Pj vector, 



and is equal to 2(pj ± Pj±) 1/2 otherwise. 

For a test mass of the child particle m v , the endpoint is related to the masses via IHO 



..max 



M w 




M 2 W 



m w 



+ m( 



(4.5) 



In the tt case, we set the test mass to m v = 0. We then expect the endpoint at ujf x = Mw(l — 
ml/Mw) = M w = 80.4 GeV. Note that m v is the true mass of the child and M w is the true 
parent mass; these should be viewed as variables in a function for which m v is a parameter. 



In a new-physics application, the analogs of Mw and m v are not known; but given Eq. 4.5 the 
measurement of the endpoint, and an arbitrary choice of child mass m v , one can fix a relationship 



between the two unknown masses. We emphasize that the equality expressed by Eq. 4.5 holds 
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regardless of the value of the test mass, because the test mass enters into both sides of the 



equation (see discussion in Section 4.1.1 1. This applies below to Eq. 4.6 also. 



In the case of ^tv the visible particles are the two b jets, the child particles are the charged 
leptons and neutrinos (combined), and ISR radiation generates the upstream transverse mo- 
mentum. We take the visible particle masses to be the observed jet masses, which are typically 
~10GeV. The endpoint is unaffected by nonzero jet masses provided the test mass is set to 
the true mass, and is affected only at the ±0.1 GeV level over a large range of test masses, 
< Mw < 2Mw- For an assumed child mass Mw, the endpoint is given by |TlO"ll2"5l : 

In the tt case, we set the test mass to Mw = Mw = 80.4 GeV. We then expect the endpoint at 
^bt> ax = M t . As in the previous case, in a new-physics application where the analogs of M t and 
Mw are not known, the measurement of the endpoint together with an arbitrary choice of the 
child mass Mw yields a relationship between the two unknown masses. 

As noted above, a third variable is needed, and we adopt M^, the invariant mass formed out 
of jet-lepton pairs emerging from the top-quark decay. Two values of M^ can be computed in 
a tt event, one for each top decay. In practice four are calculated because one does not know 
a priori how to associate the b jets and leptons; we discuss later an algorithm for mitigating 
the combinatorial effects on the endpoint. The shape of the distribution is known for correct 
combinations but is not used here since correct combinations cannot be guaranteed (see Section 
|5.3|. The endpoint is given by: 




KT = \< + 1 - T^ Ew + P* Eg + P*h (4-7) 



where £ w , E^, and p* are energies and momenta of the daughters of t — > bW in the top-quark 
rest frame. In these formulae the charged-lepton mass is neglected but the observed b-jet mass 
ni\, is finite and varies event-to-event. 

We can now summarize the mass measurement strategy. If the masses M t , Mw, and m v were 
unknown, one would measure the two endpoints and the invariant mass that appear on the 



left-hand sides of Eqs. 4.5-4.7 using arbitrary test mass values for the first two, to obtain three 
independent equations for the three unknown masses. Then, in principle, one solves for the 
three masses. In practice, the measurements carry uncertainties and an optimum solution must 
be determined by a fit. In the case when one or more of the masses is known, a constrained fit 
can improve the determination of the remaining unknown mass(es). 



In Fig. 4.2 we show distributions for the three observables }ia, ji\fo, and M\,( . Here and through- 
out this paper, the zero bin of the /% distribution, corresponding to the delta function of Eq. |4.4[ 
is suppressed to emphasize the kinematically interesting component of the shape. In the f/bb 
plot shown here, the prominent peak that dominates the figure is an analog of the delta func- 
tion in y.(j, its substantial width being due to the variable mass of the jets that enter into the f/bb 
calculation. As with the f% delta function, the peak arises from events where the axis of the 
upstream Pj falls between the two visible-object px vectors. In later plots this f/bb peak will be 
suppressed to better reveal the behavior of the distribution in the endpoint region. 

The agreement between data and MC is generally good, but the comparisons are for illustration 
only and the analysis and results that follow do not depend strongly on the MC simulation or 
its agreement with observation. 



5 Backgrounds 
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Figure 4.2: Distributions of the three kinematic distributions ]iu, ]iyo, and M\,(. Data (5.0 fb~ ) 
are shown with error bars. MC simulation is overlaid in solid color to illustrate the approx- 
imate tt signal and background content of the distributions. The backgrounds contained in 
"Other" are listed in Table [T] The zero-bin of the jiu plot is suppressed for clarity. The M\,( 
plot contains multiple entries per event (see Section [53] for details). In all cases, the simulation 
is normalized to an integrated luminosity of 5.0 fb - with next-to-leading-order (NLO) cross 
sections as described in the text. 



5 Backgrounds 

The two-lepton requirement at the core of the event selection ensures an exceptionally clean 
sample. Nevertheless a few types of background must be considered, including top-quark 
decays with r-lepton daughters, pp — > tW events, and sub-percent contributions from other 
sources. 

5.1 Physics Backgrounds 

The physics backgrounds consist of tt decays that do not conform to the dilepton topology of 
interest, as well as non-tt decays. Tablenlshows the estimation of signal and background events 
in MC simulation. The MC generators used throughout this study are MC@NLO 3.41 [27] for all 
tt samples, pythia 6.4 [28] for the diboson samples, and MadGraph 5.1.1.0 l29| for all others. 
The simulated data samples are normalized to 7 TeV NLO cross sections and an integrated 
luminosity of 5.0 fb~ . 

Events in which a top quark decays through a r lepton (e.g. t — > bT + t/ T — > bt + v^v T v T ), consti- 
tute about 13% of the events surviving all selection requirements. From the point of view of 
event selection, these events are background. The unobserved momentum carried by the extra 
neutrinos, however, ensures that these events reconstruct to Mj2 and M\,i values below their 
true values and hence fall below the endpoint of signal events with direct decays to e or ]i final 
states. We therefore include these events among the signal sample. This leaves in principle 
a small distortion to the kinematic shapes, but the distortion is far from the endpoint and its 
impact on the mass extraction is negligible. 

5.2 Modelling the Mistag Background 

In addition to the backgrounds discussed above, which fall within the bulk the distributions, it 
is essential also to treat events that lie beyond the nominal signal endpoint. In this analysis, the 
main source of such events comes from genuine tt events where one of the jets not originating 
from a top-quark decay is mistagged as a b jet. An event in which a light-quark or gluon jet is 
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Table 1: Estimate of signal and background composition in MC simulation, normalized to an 
integrated luminosity of 5.0 fb~ and NLO cross sections as described in the text. 



Process Number of Events 
tt signal (no t) 7000 
tt signal (t -> £v) 1100 



Single top (tW,tW) 


270 


Drell-Yan 


77 


Hadronic/Semileptonic tt 


55 


with misreconstructed lepton(s) 




Dibosons (WW, ZZ, WZ) 


14 


W+jets 


9 



treated as coming from a top quark can result in events beyond the endpoint in the ^bb and M^t 



distributions, as can be seen in Fig. 5.1 The measurement of /%, on the other hand, depends 



primarily on the two leptons and is unaffected by mistags. 

To determine the shape of the mistag background in f<bb and M\,t, we select a control sample 
with one b-tagged jet and one antitagged jet, where the antitagging identifies jets that are more 
likely to be light-quark or gluon jets than b jets. Antitagging uses the same algorithm as com- 
bined secondary vertex algorithm, but selects jets with a low discriminator value to obtain a 
sample dominated by light-quark and gluon jets. We classify event samples by the b-tag values 
of the two selected jets, and identify three samples of interest: a signal sample where both jets 
are b-tagged; a background sample where one jet is b-tagged and the other antitagged; and 
another background sample where both jets are antitagged. Table [2] shows the composition of 
these samples as determined in MC simulation. We select the sample consisting of pairs with 
one tagged and one antitagged jet to be the control sample and use it to determine the shape 
of the background lying beyond the signal endpoint. It contains a significant fraction of signal 
events, 27%, but these all lie below the endpoint and categorizing them as background does 
not change the endpoint fit. 

Table 2: Composition of b-tagged, dijet samples as determined in MC simulation. Each column 
is an independently selected sample; columns sum to 100%. 



2 b-tags b-tag, antitag 2 antitags 



b jet, b jet 86% 27% 7.1% 

bjet,nonbjet 14% 70% 53% 

non b jet, non b jet 0.3% 3% 40% 

The control sample is used to generate distributions in ^b an d M\,f, whose shapes are then 
characterized with an adaptive kernel density estimation (AKDE) method (30| . The underly- 
ing KDE method is a non-parametric shape characterization that uses the actual control sample 
to estimate the probability distribution function (PDF) for the background by summing event- 
by-event Gaussian kernels. In the AKDE algorithm, on the other hand, the Gaussian widths 
depend on the local density of events; empirically this algorithm yields lower bias in the final 



mass determination than alternative algorithms. Figure 5.2 shows the performance of the back- 
ground shape determination; the set of control sample events are taken from MC simulation in 
order to illustrate the composition of the background and signal. 
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Figure 5.1: Composition of MC event samples, illustrating that signal events with light-quark 
and gluon jet contamination dominate the region beyond the endpoint. The left and right M^ 
distributions contain the same information plotted with different vertical scales. The back- 
grounds contained in "Other" are listed in Table IT] 
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Figure 5.2: Background PDF shapes determined by the AKDE method, on MC samples. All 
events pass the signal selection criteria, left: M\,f, right: ^bb- The heavy black curve is the 
AKDE shape. 
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5.3 Suppressing the Combinatorial Background 

Even if the b-tagging algorithm selected only b jets, there would remain a combinatorics prob- 
lem in tt dilepton events. In the case of the M\,( distribution the matching problem arises in 
pairing the b jet to the lepton: for b jets ]\ and 72, and leptons £ + and £~, two pairings are 
possible: j\£ + ,jr£~ and ji£~,]2i + - Four values of M^g will thus be available in every event, 
but only two of them are correct. The two incorrect pairings can (but do not have to) generate 
values of M^ beyond the kinematic endpoint of M\,g in top-quark decay. To minimize the un- 
wanted background of incorrect pairings while maximizing the chance of retaining the highest 
values of M^ in correct b£ pairings, which do respect the endpoint, we employ the following 
algorithm. 

Let A and a denote the two M^ values calculated from one of the two possible b£ pairings, and 
let B and b denote the M^ values calculated from the other pairing. Choose the labeling such 
that a < A and b < B. Without making any assumptions about which pairing is correct, one 
can order the M\,t values from smallest to largest; there are six possible orderings. For example 
the ordering b, B, a, A means that the bB pairing has Mb<? values which are both smaller than 
the M\,c values in the a A pairing. In this case, while we do not know which pairing is correct, 
we can be certain that both M\,f values of the bB pairing must respect the true endpoint since 
either (a) bB is a correct pairing, in which case its M\,i values naturally lie below the endpoint, 
or (b) a A is the correct pairing, so its M\,f values lie below the true endpoint, with the bB values 
falling at yet lower values. Similar arguments apply to each of the other possible orderings. 

Table [3] shows the six possibilities. For each mass ordering shown in the left column, the right 
column shows the mass values that will be selected for use in the M\,( fit. For any given event 
only one row of the table applies. For an event falling in one of the first two rows, two values 
of M\,f enter in the subsequent fits; for an event falling in the last four rows, three values enter 
the fits. 

Table 3: M\,£ orderings: in each column the left-to-right sequencing of the a, A, b, B labels is 
from lowest Mw value to highest. The left column lists the six possible M^ orderings; the right 
column indicates for each ordering which values are selected for inclusion in the M^ plot. 



Ordering 


Selection 


bBaA 


b,B 


aAbB 


a, A 


baBA 


b,a,B 


baAB 


b,a,A 


abBA 


a,b,B 


abAB 


a,b,A 



This selection algorithm ensures that all masses used in the fits that can be guaranteed to be 
below the endpoint will be used, while any that could exceed the endpoint because of wrong 
pairings will be ignored. Note that it does not guarantee that the masses that are used are all 
from correct b£ pairings. 

6 Fit Strategy 

The kinematic observables }iu, ^btv and M\,(, along with their endpoint relations (Section |4.2| 
and background mitigation techniques (Sections |5.2} |5.3[), are combined in an unbinned event- 
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by-event maximum likelihood fit. The likelihood function is given by a product over all events 
of individual event likelihoods defined on each of the kinematic variables: 

N 

£(M) =Yl£f u (ui\M) • £f bb (u,|M) • Cf bt ( Ui \M). (6.1) 

1=1 

The vector M = (M t ,Mw,w5) contains the mass parameters to be determined by the fit, and 
each u, comprises the set of transverse momentum vectors, reconstructed object masses, and 
missing-particle test masses from which the kinematic observables }iu, ji^, and M\,£ of the 
event i are co mput ed. We fit for ml rather than m v because only ml appears in the endpoint 



formulae (Eqs. 4.5 and 4.7 1; we do not constrain m\ to be positive. As will be described more 
fully below, only the endpoint region of each variable is used in the fit. If an event i does not 
fall within the endpoint region of a given variable, the corresponding likelihood component 
(£f ", £f bb , or Cf ") defaults to unity. 



For each observable x € {^£//^bb/^w}/ the likelihood component £,• in Eq. 6.1 can be ex 



pressed in terms of the value of the observable itself, Xj = x(uj), and its kinematic endpoint, 



i max = x max (M). Explicit formulae for x max (M) are given in Eqs. 4.5 4.6 and 4.7 in the first 
two cases there is additional dependence on the missing-particle test mass. Letting the label 
a G {££,bb,b£} index the three flavors of observables, we can write the signal, background, 
and resolution shapes as S(x\x a riax ), B a (x), and TZf(x). While the form of the signal shape S(x) 
is common to all three fits, the background shape B a (x) is specific to each observable and the 
resolution function TZf(x) is specific to both the observable and the individual event. Then 



each function C a appearing on the right-hand side of Eq. 6.1 is given by the general form: 



£?(*,Kax) = fijs(y\x a max )nf(x t - y) dy + (1 - B)B a (x t ). (6.2) 

The fit parameter B determines the relative contribution of signal and mistag background. 

For the common signal shape S(x|x^ ax ) we use an approximation consisting of a kinked-line 
shape, constructed piecewise from a descending straight line in the region just below the end- 
point and a constant zero value above the endpoint. The kinked-line function is defined over a 
range from X\ to x^. The generic form is: 

*{X\X max ) - | Q ^^ < x < Xh ._ (6-3) 

The parameter J\f is fixed by normalization. The fidelity of this first-order approximation to the 
underlying shape depends on both the shape and the value of X\ . The range of the fit, (x\ 0l x^i), 
is chosen to minimize the dependence of the fit results on the range, and then the values of x\ 
and Xhi are subsequently varied to estimate the corresponding systematic uncertainties. 

The following paragraphs discuss the forms of B a (x) and lZ a (x) for each of the three kinematic 
distributions. 

6.1 fl££ 

In the case of ji^, the visible particles are the two leptons, which are well measured. The projec- 
tion of their vectors onto the axis orthogonal to the upstream Pj, however, necessarily involves 
the direction of the upstream Pj, which is not nearly as well determined. The resolution func- 
tion is therefore wholly dominated by the angular uncertainty in Pj, and it varies substantially 
from event to event depending on the particular configuration of jets found in each event. 
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Although jet resolutions are known to have small non-Gaussian tails, their impact on the jiff 
resolution function and the subsequent fit procedure is small and we treat only the Gaussian 
core. A far more important feature of the resolution arises when the Pj direction uncertainty is 
propagated into the fly variable to derive Tlfix). In this procedure a sharp Jacobian peak ap- 
pears wherever the Pj smearing can cause jiff to pass through a local maximum or minimum 
value. These peaks depend only on azimuthal angles and occur at any value of ji^. The de- 
tailed shape of the highly non-Gaussian jiu resolution and its convolution with the underlying 



signal shape, as specified in Eq. 6.2 are handled by exact formulae derived analytically (see 
Appendix). The background in the jiff distribution is vanishingly small, so we set B (x) = 0. 

6.2 ji bb 

For ^t,b/ the visible particles are the b jets, and since the resolution smearing of both the b 
jets and the upstream jets defining Px are large and of comparable magnitudes, the event-by- 
event resolution is more complicated than in the jiff case. As a result, no analytic calculation 
is possible and we instead determine the pi\,\, resolution function, 7?, bb (x), numerically in each 
event, using the known tJj and (p resolution functions for the jets. As with the jiff resolutions, 
Jacobian peaks appear in the u^ resolutions. The mistag background is i nclud ed by scaling the 



shape B bb (x) obtained from the AKDE procedure as discussed in Section 5.2 

6.3 M M 

In the Mb( case, the theoretical shape S(x) is well-known, but the combinatorics of b£ matching, 
together with the method of selecting hi pairs from the available choices (see Section [53] l, sculpt 
the distribution to the degree that the theoretical shape is no longer useful. Therefore we use the 



kinked-line shape of Eq. 6.3 to model the signal near the endpoint. In contrast to the jiff and ^,b 
variables, numerical studies confirm that linearly propagated Gaussian resolutions accurately 
reflect the smearing 1ZY{ X ) of M^f, as one expects in this case. The background shape B M (x) is 



given by the AKDE procedure as discussed in Section 5.2 
6.4 Applying the Fit to Data 



The unbinned likelihood fit prescribed in Eqs. 6.1 and 6.2 is performed on the three kinematic 



distributions using the shapes given for signal S(x\x max ), resolution TZi(x), and mistag back- 
ground B(x). Although a simultaneous fit for all three masses is an important goal of this study, 
it is useful in the context of the tt data sample to explore subclasses of the fit in which some 
masses are constrained to their known values. For this purpose we define: (a) the unconstrained 
fit, in which all three masses are fit simultaneously; (b) the singly-constrained fit, in which m v = 
is imposed; and (c) the doubly-constrained fit, in which both m v = and Mw = 80.4 GeV are im- 
posed pi). The unconstrained fit is well-suited to testing mass measurement techniques for 
new physics, while the doubly-constrained fit is optimal for a SM determination of the top- 
quark mass. 

The fit procedure takes advantage of bootstrapping techniques ||32| . In particle physics, boot- 
strapping is typically encountered in situations involving limited MC samples, but it can be 
profitably applied to a single data sample, as in this analysis. The goal of bootstrapping is to 
obtain the sampling distribution of a statistic of a particular data set from the data set itself. 
With the distribution in hand, related quantities such as the mean and variance of the statistic 
are readily computable. 

In order to estimate the sampling distribution of a statistic, we first need to estimate the distri- 
bution from which the data set was drawn. The basic assumption of bootstrapping is that the 
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6 Fit Strategy 



best estimate for this distribution is given by a normalized sum of delta functions, one for each 
data point. This is the bootstrap distribution. One can estimate the distribution of a statistic 
of the data by drawing samples from the bootstrap distribution and calculating the statistic on 
each sample. To simplify the process further we note that, since the bootstrap distribution is 
composed of a delta function at each data point, sampling from the bootstrap distribution is 
equivalent to sampling from the observed data. 

In this analysis, the fitted top-quark mass is the statistic of interest, and we wish to find its 
mean and standard deviation. To do so, we conduct the fit 200 times, each time extracting a 
new sampling of events from the 8700 selected events in the signal region of the full data set. 
The sampling is done with replacement, which means that each of these bootstrapped pseudo- 
experiments has the same number of events (N = 8700) as the original data set, and any given 
event may appear in the bootstrap sample more than once. Each bootstrapped sample is fit with 
the unbinned likelihood method described in the previous subsections. As an illustration, we 



show in Fig. 6.1 the distribution of the 200 values of Mt that emerges in the case of the doubly- 
constrained fit; the mean and its standard deviation in this distribution, M t = 173.9 ± 0.9 GeV, 
constitute the final result of the doubly-constrained fit. 
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Figure 6.1: Distribution of M t in doubly-constrained fits of 200 pseudo-experiments boot- 
strapped from the full data set. 



A key motivation for applying bootstrapping to the data is that the impact of possible fluctua- 
tions in the background shape are naturally incorporated. Because the background shape in a 



given fit is constructed from a control sample with the AKDE method (Section 5.2 1, the possible 
statistical variation in the shape is most easily accounted for by multiple samplings of the con- 
trol sample. Thus for each bootstrap sample taken from the signal region of the data, another 
is taken simultaneously from the set of background control events. Each pseudo-experiment 
fit therefore has its own background function and the ensemble of all 200 such fits automati- 
cally includes background shape uncertainties. (The total background yield is a separate issue, 
handled by the normalization parameter that floats in each fit.) 

A secondary motivation to use bootstrapping on the data sample is that it offers a convenient 
mechanism to correct for event selection and reconstruction efficiencies [33 1. To do so, each 
event is assigned a sampling weight equal to the inverse of its efficiency, and during the boot- 
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strap process events are selected with probabilities proportional to these weights. A boot- 
strapped data set therefore looks like efficiency-corrected data, but each event is whole and 
unweighted. 

7 Validation 

We test for bias in the above procedures by performing pseudo-experiments on simulated 
events. Each pseudo-experiment yields a measurement and its uncertainty for M t . From these 
a pull can be calculated, defined by pull = (m meas — m gen )/c meas . In this expression m gen is 
the top-quark mass used in generating events while m meas and c meas are the fitted mass and its 
uncertainty, determined for each pseudo-experiment. For an unbiased fit, the pull distribution 
will be a Gaussian of unit width and zero mean. A non-zero mean indicates the method is 
biased, while a non-unit width indicates that the uncertainty is over- or under-estimated. We 
increase the precision with which we determine the pull distribution width by bootstrapping 
the simulation to generate multiple pseudo-experiments. The results of Ref. |34] are then used 
to calculate the mean and standard deviation of the pull distribution, along with uncertain- 
ties on each, taking into account the correlations between pseudo-experiments introduced by 
over-sampling. 



Figure |7.1[ left, shows the pull distribution for the doubly-constrained fit over 150 pseudo- 
experiments. Extracting a result from each pseudo-experiment involves the methods discussed 



in Section 6.4 and thus the total number of pseudo-experiments required for the study is 150 x 
200. The measured pull mean is 0.15 ± 0.19 and the pull standard deviation is 0.92 ± 0.06, 
indicating that the fit is unbiased to the level at which it can be measured with the available 
simulated data. The slightly low standard deviation suggests that the statistical uncertainty 
may be overestimated; since the systematic uncertainty is significantly larger than the statistical 
error, we do not make any correction for this. 

In an independent test, we perform fits to MC samples generated with various M t values. As 



the results, shown in Fig. 7.1 right, indicate that there is no significant bias as a function of the 



top-quark mass, we make no correction. 

8 Systematic Uncertainties 

The systematic uncertainties are assessed by varying the relevant aspects of the fit and re- 
evaluating the result. All experimental systematic uncertainties are estimated in data. In the 
doubly-constrained fit, uncertainties are evaluated for the fitted top-quark mass M t . 

The systematic uncertainties related to absolute jet energy scale (JES) are derived from the cal- 
ibration outlined in Ref. [20]. We evaluate the effects of JES uncertainties in this analysis by 
performing the analysis two additional times: once with the jet energies increased by one stan- 
dard deviation of the JES, and once with them decreased by the same amount. Each jet is varied 
by its own JES uncertainty, which varies with the pj and tj values of the jet. In a generic sample 
of multijet events, selecting jets above 30 GeV, the fractional uncertainty in the JES (averaged 
over tj) ranges from 2.8% at the low end to 1% at high pj. The uncertainty is narrowed further 
by using flavor-specific corrections to b jets. A similar process is carried out for varying the 
jet energy resolutions by its uncertainties. These variations of jet energy scale and jet energy 
resolution propagate into uncertainties of ^ '| GeV and ±0.5 GeV on the measured top-quark 
mass, respectively. 

The choice of fit range in ^bb and M^p introduces an uncertainty due to slight deviations from 
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Figure 7.1: (left) Pull distribution (m meas — in gen 
are fixed) across 150 MC pseudo-experiments, (right) Fit results obtained in MC tt-only sam- 
ples generated with MadGraph for various top-quark masses. The best-fit calibration is 
shown by the solid line and the line of unit slope is shown in the dashed line. Data points 
are from doubly-constrained fits. The line of unit slope agrees with the fit results with 
X 2 /degree of freedom =10.7/9. 

linearity in the descending portion of these distributions. Separately varying the upper and 
lower ends of the ^bb an d M^ fit range gives an estimate of ±0.6 GeV for the systematic uncer- 
tainty. The uncertainty is mainly driven by dependence on the lower end of the ^bb range. A 
cross-check study based on the methods of Ref . |135| confirms the estimate. 

The AKDE shape which is used to model the mistag background in ^,b an d M^e is non- 
parametric and derived from data. For this reason, the AKDE is not subject to biases stemming 
from assumptions about the underlying background shape or those inherent in MC simulation. 
However, one could also model the mistag background with a parametric shape, and we use 
this alternative as a way to estimate the uncertainty due to background modeling. Based on 
comparisons among the default AKDE background shape and several parametric alternatives, 
we assign a systematic uncertainty of ±0.5 GeV. 

Efficiency can affect the results of this analysis if it varies across the region of the endpoint 
in one or more of the kinematic plots. The M\,f observable is sensitive to both b-tagging and 
lepton efficiency variations, whereas ^b i s only sensitive to uncertainties due to b-tagging 
efficiency. By varying the b-tagging and lepton selection efficiencies we estimate that the effect 
of the efficiency uncertainty contributes at most t.0'2 GeV uncertainty to the measured top-quark 
mass. 

The dependence on pileup is estimated by conducting studies of fit performance and results 
with data samples that have been separated into low-, medium-, and high-pileup subsamples 
of equal population. The dependence is found to be negligible. In addition, direct examination 
of the variables /^bb an d M^g reveals that their correlation with the number of primary vertices 
is small, with correlation coefficients < 3% and < 1%, respectively. 

The sensitivity of the result to uncertainties in QCD calculations is evaluated by generating 
simulated event samples with varied levels of color-reconnection to beam remnants, factoriza- 
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tion scale, and jet-parton matching scale. The impact of the variations on M t is dominated by 
the color reconnection effects, which are estimated by comparing the results of simulations per- 
formed with two different MC tunes I36j, PERUGIA2011 and PERUGIA2011NOCR. The overall 
systematic error attributed to QCD uncertainties is ±0.6 GeV on the value of M t . In quadrature 
with other systematic uncertainties these simulation-dependent estimates add 0.1 GeV to both 
the upper and lower systematic uncertainties. This additional contribution reflects theoretical 
uncertainty in the interpretation of the measurement as a top-quark mass, and unlike other 
systematic uncertainties in the measurement, is essentially dependent on the reliability of the 
MC modeling. 

For the unconstrained and singly-constrained fits, where the objective is primarily to demon- 
strate a method, rather than to achieve a precise result, we have limited the investigation of 
systematic uncertainties to just the evaluation of the jet energy scale and fit range variations, 
which are known from the doubly-constrained case to be the dominant systematic contribu- 
tions. Because of this, the systematic uncertainties displayed for these fits are slightly lower 
than they would be with a fuller treatment of all contributions. 

9 Results and Discussion 

The simultaneous fit to the three distributions determines m 2 , Mw, and M t . A complete sum- 
mary of central values and statistical and systematic uncertainties for all three mass constraints 
can be found in Table El Figure 9.1 shows the corresponding fits. 



Table 4: Fit results from the three mass analyses with various mass constraints. Uncertainties 
are statistical (first) and systematic (second). Values in parentheses are constrained in the fit. 
For the neutrino, squared mass is the natural fit variable - see text for discussion. 



Constraint 


Fit quantity 


None 


m v = m v 


= and M w = 80.4 GeV 


m 2 v (GeV 2 ) 


-556 ± 473 ± 622 


(0) 


(0) 


M w (GeV) 


72 ± 7 ± 9 


80.7 ±1.1 ±0.6 


(80.4) 


M t (GeV) 


163 ±10 ±11 


174.0 ± 0.9+^ 


173.9 ± 0.9+^j 



We take the doubly-constrained version to be the final result: 

M t = 173.9 ± 0.9 (statOl^o ( s Y st -) GeV - (9- 1 ) 

In the more general case of the unconstrained measurement, the performance of the endpoint 
method illustrated here in the tt dilepton system suggests the technique will be a viable option 
for mass measurements in a variety of new-physics scenarios. The precision on Mt given by 
the doubly-constrained fit, for example, is indicative of the precision with which we might 
determine the masses of new colored particles (like squarks), as a function of the input test mass 
fh v . Of course, as shown in the second column of Table [4J the input mass m v itself will be 
determined less precisely. Another plausible scenario is one in which new physics mimics the 
leptonic decay of the W boson. This can arise in SUSY with ^-parity violation and a lepton- 
number violating term in the superpotential. In this case, the lightest superpartner could be 
the charged slepton, which decays to a lepton and neutrino, just like the SM W boson. Current 
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Figure 9.1: Results of simultaneous fits to ml, Mw, and Mf. The upper red line is in all cases 
the full fit, while the green (middle) and blue (lowest) curves are for the background and signal 
shapes, respectively. While the fit is performed event-by-event for all measured kinematic 
values, the line shown is an approximate extrapolation of the total fit likelihood function over 
the entire fit range. Top row: unconstrained fit; Middle row: singly-constrained fit; Bottom 
row: doubly-constrained fit. The inset shows a zoom of the tail region in M\,t for the doubly- 
constrained case to illustrate the level of agreement between the background shape and the 
data points. 
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bounds from LEP indicate that the slepton must be heavier than 100 GeV. Given the ~1 GeV 
precision provided by the singly-constrained fit on the W boson mass, the W boson can easily 
be discriminated from such an object based on its mass. 

It is interesting to note also that in the unconstrained case, one can restrict the range of the neu- 
trino mass (which is treated as an unknown parameter) reasonably well, within approximately 
20 GeV, in line with previous expectations 113 7|| . If the E™ ss signal is due to SM neutrinos, rather 
than heavy WIMPs with masses of order 100 GeV, this level of precision is sufficient to distin- 
guish the two cases. If, on the other hand, the E™ ss signal is indeed due to heavy WIMPs, one 
might expect that the precision on the WIMP mass determination will be no worse than what 
is shown here for the neutrino, assuming comparable levels of signal and background. 

10 Conclusions 

A new technique of mass extraction has been applied to tt dilepton events. Motivated pri- 
marily by future application to new-physics scenarios, the technique is based on endpoint 
measurements of new kinematic variables. The three mass parameters ml, Mw, and M t are 
obtained in a simultaneous fit to three endpoints. In an unconstrained fit to the three masses, 
the measurement confirms the utility of the techniques proposed for new-physics mass mea- 
surements. When ml and Mw are constrained to and 80.4 GeV respectively, we find M t = 
173.9 ± 0.9 (stat.)^2 o ( s y s *-) GeV, comparable to other dilepton measurements. This is the first 
measurement of the top-quark mass with an endpoint method. In addition to providing a 
novel approach to a traditional problem, it achieves a precision similar to that found in stan- 
dard methods, and its use lays a foundation for application of similar methods to future studies 
of new physics. 
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A Analytical resolution functions for fi a 

We present the analytical forms of the resolution functions used in the f% fits, together with a 
brief summary of their derivation. 

The leptons used in computing }i^ are approximately massless and therefore the Mj2± variable 
may be written [25| as 

V-ii = 2 ( £ ti±Et2± - Pti± • PT2±) ,. -. 

=2p T1 p T2 [ | sin(0i - <pus) sin(<£ 2 - (p\js) \ ~ sin(0i - (pus) sin(</> 2 - <£us) ] , 

where (pji, (pi) are the transverse coordinates of lepton i G {1, 2} and (pus is the azimuthal angle 
of the upstream momentum in the CMS reference frame. 

If the upstream Pj vector happens to lie between the two lepton vectors pxi and pj2/ so that 
<pi ~ (pus > an d <pi — (pus < (or vice-versa) then the value of p« is identically zero. This 
is the origin of the delta function in Eq. |4.4| It is convenient to measure (pus from the midline 
between the lepton px vectors rather than from the CMS-defined x axis, and hence we define 
O = (pus — \{(pi + (pi)- We also define the separation between the two lepton vectors: A(p = 
(pi ~ (pi- 



Eq. |A.l can now be rewritten as: 



m(<d) = I v 4pTlPT2Sin( °" 2 A ^) sin (° + l A( P) when 1*1 > I i^l; (A2 ) 

[ otherwise. 

To streamline the notation, we have dropped the subscript it. (In any case these remarks apply 
only to calculations on the ££ system.) 

The leptonic observables pj\, pj2, (pi, and cp2 are well-measured compared to the direction of 
the upstream jets, O, and thus the resolution of p(O) in a given event depends only on the O 
resolution, with the leptonic observables treated as fixed parameters. The distribution of O is 
well-approximated by a Gaussian form, with a® <^ n; we ignore small non-Gaussian tails. 



The functional form given in Eq. A.2 is maximal at O = 7i/2, falls to zero on either side at 

O = ±^A(p, and is exactly zero in the neighboring regions [0, 5 — \&(p] and [5 + ^A(p, n]. The 

function is periodic in O with period n, but because of the condition a® <^ n we restrict our 

attention to the interval < O < n. For the non-zero portion of ^(O) there is also an inverse 

function: 

1 



*(M) = 2 COS " 



2 2 

fmax — r 



(A.3) 



2pnPT2 

The inverse function O(p) is double-valued as one value of ]i maps to two values of O located 
symmetrically on either side of rt/2. The maximum value of ]i, here denoted }i max , is the largest 
value p can take on for the given the lepton momentum vectors; it corresponds to O = n/2 
where the lepton bisector is orthogonal to the upstream momentum. It should not be confused 
with the endpoint of the y. distribution, which, in addition to the upstream momentum orien- 
tation O = n/l, also requires extreme lepton kinematics: pjipii maximal and A(p = (leptons 
collinear). 
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Figure A.l: Example resolution functions. The panels show two events with different lepton 
and upstream momentum kinematics, as discussed in the text. The dotted curve is a Gaussian 
with a a given by the linearly propagated uncertainties; and the solid curve is the analytic 
form of the resolution function, given in Eq. ( A.4| . The histogram is created by numerically 
propagating resolutions in the underlying parameters. 



To map the Gaussian PDF G(0|c r <j > ) into a resolution function R\ (ji), we write: 

d<D(» 



RiM = E G (<WK) 



d}i 



(A.4) 



where the sum is over the two branches of the double-valued 0(f/). The derivatives of O(^) 
and f<(0) have simple analytic forms. 

In the region where f/(0) = 0, R(^) is a delta function RqS(}i) whose amplitude Rrj is given by 
the area under G(0|co) in the angular region between the two leptons, Rrj = /_L,/ 2 G(0|(7Vj>) dO. 
Thus the total resolution function is given by 



R(p) = Ro6(n) +0(/O£G(O( F )|cr o ) 



dO(^) 



d}i 



(A.5) 



where ©(/^) is the unit step function transitioning from to 1 at ]l = 0. 



Figure A.l shows two representative cases, showing the range of resolution function behavior 
from Gaussian to sharply peaked. In the latter case the delta function Rrj<S(^) is not plotted. In 
the left panel the O is midway between the extremes ±^Acj) and n/2 and the o~® is relatively 
narrow; in the right panel, O is closer to n/2 and has a large value of cr& that allows smearing 
into the — \&<p < O < ^Acp region. The high bin at —45 GeV in the histogram component of 
the right panel contains events in which the resolution smearing of the upstream momentum 
vector pushed the fin value into the delta function at \igj = 0. In the analytic form, the cor- 
responding feature would be the delta function Rq5{ji); but, as noted above, this has not been 
explicitly drawn. 
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